Segmentation of Persian Cursive Words Using Basic Shapes
نویسندگان
چکیده
Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of handwriting recognition process. However it is a complicated and timeconsuming task. In this paper, we introduce the concepts of basic shapes and explore its application for segmentation of Persian words. Considering a set of pre-defined shapes include line and open or closed curve extracted from Persian alphabets, our approach will employ those shapes with decision tree technique to divide a cursive word into segments in a less complicated process. Experimental results showed 98.83% accuracy in segmenting Persian words.
منابع مشابه
A New Approach to Segmentation of Persian Cursive Script based on Adjustment the Fragments
Optical Character Recognition (OCR) is a very old and of great interest in pattern recognition field. The recognition of cursive scripts like Persian and Arabic languages is a difficult task as their segmentation suffers from serious problems in different languages. Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of re...
متن کاملA New Segmentation Algorithm for Online Handwritten Word Recognition in Persian Script
The cursive nature of Persian alphabet, and the complex and convoluted rules regarding this script cause major challenges to segmentation as well as recognition of Persian words. We propose a new segmentation algorithm for the main stroke of online Persian handwritten words. Using this segmentation, we present a perturbation method which is used to generate artificial samples from handwritten w...
متن کاملRobust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach
The presence of a large number of unique shapes called ligatures in cursive languages, along with variations due to scaling, orientation and location provides one of the most challenging pattern recognition problems. Recognition of the large number of ligatures is often a complicated task in oriental languages such as Pashto, Urdu, Persian and Arabic. Research on cursive script recognition ofte...
متن کاملA Dynamic Programming Method for Segmentation of Online Cursive Uyghur Handwritten Words into Basic Recognizable Units
Correct and efficient segmentation of Uyghur words into characters is crucial to the successful recognition. However, little work has been done in this area. There are many connected characters in cursive Uyghur handwriting, which makes the segmentation and recognition of Uyghur words very difficult. To enable large vocabulary Uyghur word recognition using character models, we propose a charact...
متن کاملA Robust Free Size OCR for Omni-Font Persian/Arabic Printed Document Using Combined MLP/SVM
Optical character recognition of cursive scripts present a number of challenging problems in both segmentation and recognition processes and this attracts many researches in the field of machine learning. This paper presents a novel approach based on a combination of MLP and SVM to design a trainable OCR for Persian/Arabic cursive documents. The implementation results on a comprehensive databas...
متن کامل